Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 426880 |
| Missing cells | 1215152 |
| Missing cells (%) | 15.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 58.6 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 4 |
|---|---|
| Text | 4 |
| Categorical | 10 |
drive is highly overall correlated with type | High correlation |
odometer is highly overall correlated with year | High correlation |
type is highly overall correlated with drive | High correlation |
year is highly overall correlated with odometer | High correlation |
fuel is highly imbalanced (62.7%) | Imbalance |
title_status is highly imbalanced (89.9%) | Imbalance |
manufacturer has 17646 (4.1%) missing values | Missing |
model has 5277 (1.2%) missing values | Missing |
condition has 174104 (40.8%) missing values | Missing |
cylinders has 177678 (41.6%) missing values | Missing |
odometer has 4400 (1.0%) missing values | Missing |
title_status has 8242 (1.9%) missing values | Missing |
VIN has 161042 (37.7%) missing values | Missing |
drive has 130567 (30.6%) missing values | Missing |
size has 306361 (71.8%) missing values | Missing |
type has 92858 (21.8%) missing values | Missing |
paint_color has 130203 (30.5%) missing values | Missing |
price is highly skewed (γ1 = 254.4069323) | Skewed |
odometer is highly skewed (γ1 = 38.04001486) | Skewed |
id has unique values | Unique |
price has 32895 (7.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-24 11:25:51.546892 |
|---|---|
| Analysis finished | 2024-04-24 11:26:22.860620 |
| Duration | 31.31 seconds |
| Software version | ydata-profiling vv4.6.5 |
| Download configuration | config.json |
id
Real number (ℝ)
UNIQUE 
| Distinct | 426880 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.3114866 × 109 |
| Minimum | 7.2074081 × 109 |
|---|---|
| Maximum | 7.3171011 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 7.2074081 × 109 |
|---|---|
| 5-th percentile | 7.3031501 × 109 |
| Q1 | 7.3081433 × 109 |
| median | 7.3126208 × 109 |
| Q3 | 7.3152535 × 109 |
| 95-th percentile | 7.3167433 × 109 |
| Maximum | 7.3171011 × 109 |
| Range | 1.0969296 × 108 |
| Interquartile range (IQR) | 7110204.2 |
Descriptive statistics
| Standard deviation | 4473170.4 |
|---|---|
| Coefficient of variation (CV) | 0.0006118004 |
| Kurtosis | 17.057761 |
| Mean | 7.3114866 × 109 |
| Median Absolute Deviation (MAD) | 3096588 |
| Skewness | -1.4301233 |
| Sum | 3.1211274 × 1015 |
| Variance | 2.0009254 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7222695916 | 1 | < 0.1% |
| 7313139418 | 1 | < 0.1% |
| 7313423023 | 1 | < 0.1% |
| 7313423324 | 1 | < 0.1% |
| 7313424533 | 1 | < 0.1% |
| 7313425823 | 1 | < 0.1% |
| 7313426990 | 1 | < 0.1% |
| 7313427132 | 1 | < 0.1% |
| 7313426423 | 1 | < 0.1% |
| 7313426503 | 1 | < 0.1% |
| Other values (426870) | 426870 |
| Value | Count | Frequency (%) |
| 7207408119 | 1 | |
| 7208549803 | 1 | |
| 7209027818 | 1 | |
| 7209054699 | 1 | |
| 7209064557 | 1 | |
| 7210384030 | 1 | |
| 7212512589 | 1 | |
| 7212631321 | 1 | |
| 7213839225 | 1 | |
| 7213843538 | 1 |
| Value | Count | Frequency (%) |
| 7317101084 | 1 | |
| 7317098990 | 1 | |
| 7317098055 | 1 | |
| 7317096748 | 1 | |
| 7317096685 | 1 | |
| 7317096571 | 1 | |
| 7317096373 | 1 | |
| 7317096141 | 1 | |
| 7317096101 | 1 | |
| 7317096069 | 1 |
region
Text
| Distinct | 404 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 20 |
| Mean length | 11.44423 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4885313 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | prescott |
|---|---|
| 2nd row | fayetteville |
| 3rd row | florida keys |
| 4th row | worcester / central MA |
| 5th row | greensboro |
| Value | Count | Frequency (%) |
| 64305 | 8.6% | |
| city | 12302 | 1.6% |
| new | 9171 | 1.2% |
| bay | 8365 | 1.1% |
| st | 7915 | 1.1% |
| san | 7639 | 1.0% |
| south | 7598 | 1.0% |
| county | 6893 | 0.9% |
| jersey | 6781 | 0.9% |
| fort | 6553 | 0.9% |
| Other values (491) | 610100 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 476629 | 9.8% |
| e | 409823 | 8.4% |
| o | 365776 | 7.5% |
| n | 348420 | 7.1% |
| 320742 | 6.6% | |
| s | 315838 | 6.5% |
| l | 303305 | 6.2% |
| t | 284497 | 5.8% |
| r | 283439 | 5.8% |
| i | 276202 | 5.7% |
| Other values (45) | 1500642 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4381566 | |
| Space Separator | 320742 | 6.6% |
| Other Punctuation | 78847 | 1.6% |
| Uppercase Letter | 73048 | 1.5% |
| Dash Punctuation | 31094 | 0.6% |
| Open Punctuation | 8 | < 0.1% |
| Close Punctuation | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 476629 | |
| e | 409823 | 9.4% |
| o | 365776 | 8.3% |
| n | 348420 | 8.0% |
| s | 315838 | 7.2% |
| l | 303305 | 6.9% |
| t | 284497 | 6.5% |
| r | 283439 | 6.5% |
| i | 276202 | 6.3% |
| c | 167082 | 3.8% |
| Other values (16) | 1150555 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 10188 | |
| S | 8797 | |
| O | 8467 | |
| M | 7881 | |
| D | 5443 | |
| F | 4427 | 6.1% |
| N | 4048 | 5.5% |
| J | 3662 | 5.0% |
| A | 3590 | 4.9% |
| W | 3253 | 4.5% |
| Other values (12) | 13292 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 66296 | |
| , | 9563 | 12.1% |
| ' | 2988 | 3.8% |
Space Separator
| Value | Count | Frequency (%) |
| 320742 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 31094 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4454614 | |
| Common | 430699 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 476629 | 10.7% |
| e | 409823 | 9.2% |
| o | 365776 | 8.2% |
| n | 348420 | 7.8% |
| s | 315838 | 7.1% |
| l | 303305 | 6.8% |
| t | 284497 | 6.4% |
| r | 283439 | 6.4% |
| i | 276202 | 6.2% |
| c | 167082 | 3.8% |
| Other values (38) | 1223603 |
Common
| Value | Count | Frequency (%) |
| 320742 | ||
| / | 66296 | 15.4% |
| - | 31094 | 7.2% |
| , | 9563 | 2.2% |
| ' | 2988 | 0.7% |
| ( | 8 | < 0.1% |
| ) | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4885313 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 476629 | 9.8% |
| e | 409823 | 8.4% |
| o | 365776 | 7.5% |
| n | 348420 | 7.1% |
| 320742 | 6.6% | |
| s | 315838 | 6.5% |
| l | 303305 | 6.2% |
| t | 284497 | 5.8% |
| r | 283439 | 5.8% |
| i | 276202 | 5.7% |
| Other values (45) | 1500642 |
price
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 15655 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75199.033 |
| Minimum | 0 |
|---|---|
| Maximum | 3.7369287 × 109 |
| Zeros | 32895 |
| Zeros (%) | 7.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5900 |
| median | 13950 |
| Q3 | 26485.75 |
| 95-th percentile | 44500 |
| Maximum | 3.7369287 × 109 |
| Range | 3.7369287 × 109 |
| Interquartile range (IQR) | 20585.75 |
Descriptive statistics
| Standard deviation | 12182282 |
|---|---|
| Coefficient of variation (CV) | 162.00052 |
| Kurtosis | 69205.089 |
| Mean | 75199.033 |
| Median Absolute Deviation (MAD) | 9450 |
| Skewness | 254.40693 |
| Sum | 3.2100963 × 1010 |
| Variance | 1.48408 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 32895 | 7.7% |
| 6995 | 3169 | 0.7% |
| 7995 | 3129 | 0.7% |
| 9995 | 2867 | 0.7% |
| 8995 | 2837 | 0.7% |
| 4500 | 2778 | 0.7% |
| 5995 | 2727 | 0.6% |
| 3500 | 2716 | 0.6% |
| 29990 | 2705 | 0.6% |
| 6500 | 2594 | 0.6% |
| Other values (15645) | 368463 |
| Value | Count | Frequency (%) |
| 0 | 32895 | |
| 1 | 1951 | 0.5% |
| 2 | 13 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 16 | < 0.1% |
| 6 | 12 | < 0.1% |
| 7 | 8 | < 0.1% |
| 8 | 7 | < 0.1% |
| 9 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 3736928711 | 2 | < 0.1% |
| 3024942282 | 2 | < 0.1% |
| 3009548743 | 1 | < 0.1% |
| 1410065407 | 1 | < 0.1% |
| 1234567890 | 1 | < 0.1% |
| 1111111111 | 2 | < 0.1% |
| 987654321 | 2 | < 0.1% |
| 135008900 | 1 | < 0.1% |
| 123456789 | 6 | |
| 113456789 | 1 | < 0.1% |
year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 114 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1205 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2011.2352 |
| Minimum | 1900 |
|---|---|
| Maximum | 2022 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1998 |
| Q1 | 2008 |
| median | 2013 |
| Q3 | 2017 |
| 95-th percentile | 2020 |
| Maximum | 2022 |
| Range | 122 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 9.4521196 |
|---|---|
| Coefficient of variation (CV) | 0.004699659 |
| Kurtosis | 19.579889 |
| Mean | 2011.2352 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -3.5779204 |
| Sum | 8.5613254 × 108 |
| Variance | 89.342565 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2017 | 36420 | 8.5% |
| 2018 | 36369 | 8.5% |
| 2015 | 31538 | 7.4% |
| 2013 | 30794 | 7.2% |
| 2016 | 30434 | 7.1% |
| 2014 | 30283 | 7.1% |
| 2019 | 25375 | 5.9% |
| 2012 | 23898 | 5.6% |
| 2011 | 20341 | 4.8% |
| 2020 | 19298 | 4.5% |
| Other values (104) | 140925 |
| Value | Count | Frequency (%) |
| 1900 | 12 | |
| 1901 | 3 | < 0.1% |
| 1902 | 1 | < 0.1% |
| 1903 | 12 | |
| 1905 | 1 | < 0.1% |
| 1909 | 1 | < 0.1% |
| 1910 | 2 | < 0.1% |
| 1913 | 2 | < 0.1% |
| 1915 | 1 | < 0.1% |
| 1916 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2022 | 133 | < 0.1% |
| 2021 | 2396 | 0.6% |
| 2020 | 19298 | |
| 2019 | 25375 | |
| 2018 | 36369 | |
| 2017 | 36420 | |
| 2016 | 30434 | |
| 2015 | 31538 | |
| 2014 | 30283 | |
| 2013 | 30794 |
manufacturer
Categorical
MISSING 
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17646 |
| Missing (%) | 4.1% |
| Memory size | 3.3 MiB |
| ford | |
|---|---|
| chevrolet | |
| toyota | |
| honda | |
| nissan | 19067 |
| Other values (37) |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 5.7946578 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2371371 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | gmc |
|---|---|
| 2nd row | chevrolet |
| 3rd row | chevrolet |
| 4th row | toyota |
| 5th row | ford |
Common Values
| Value | Count | Frequency (%) |
| ford | 70985 | |
| chevrolet | 55064 | |
| toyota | 34202 | 8.0% |
| honda | 21269 | 5.0% |
| nissan | 19067 | 4.5% |
| jeep | 19014 | 4.5% |
| ram | 18342 | 4.3% |
| gmc | 16785 | 3.9% |
| bmw | 14699 | 3.4% |
| dodge | 13707 | 3.2% |
| Other values (32) | 126100 | |
| (Missing) | 17646 | 4.1% |
Length
| Value | Count | Frequency (%) |
| ford | 70985 | |
| chevrolet | 55064 | |
| toyota | 34202 | 8.4% |
| honda | 21269 | 5.2% |
| nissan | 19067 | 4.7% |
| jeep | 19014 | 4.6% |
| ram | 18342 | 4.5% |
| gmc | 16785 | 4.1% |
| bmw | 14699 | 3.6% |
| dodge | 13707 | 3.3% |
| Other values (32) | 126121 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 257522 | 10.9% |
| e | 239422 | 10.1% |
| r | 196161 | 8.3% |
| a | 186064 | 7.8% |
| d | 162166 | 6.8% |
| t | 136711 | 5.8% |
| c | 124158 | 5.2% |
| n | 114989 | 4.8% |
| l | 106299 | 4.5% |
| i | 99297 | 4.2% |
| Other values (17) | 748582 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2358459 | |
| Dash Punctuation | 12891 | 0.5% |
| Space Separator | 21 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 257522 | 10.9% |
| e | 239422 | 10.2% |
| r | 196161 | 8.3% |
| a | 186064 | 7.9% |
| d | 162166 | 6.9% |
| t | 136711 | 5.8% |
| c | 124158 | 5.3% |
| n | 114989 | 4.9% |
| l | 106299 | 4.5% |
| i | 99297 | 4.2% |
| Other values (15) | 735670 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12891 |
Space Separator
| Value | Count | Frequency (%) |
| 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2358459 | |
| Common | 12912 | 0.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 257522 | 10.9% |
| e | 239422 | 10.2% |
| r | 196161 | 8.3% |
| a | 186064 | 7.9% |
| d | 162166 | 6.9% |
| t | 136711 | 5.8% |
| c | 124158 | 5.3% |
| n | 114989 | 4.9% |
| l | 106299 | 4.5% |
| i | 99297 | 4.2% |
| Other values (15) | 735670 |
Common
| Value | Count | Frequency (%) |
| - | 12891 | |
| 21 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2371371 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 257522 | 10.9% |
| e | 239422 | 10.1% |
| r | 196161 | 8.3% |
| a | 186064 | 7.8% |
| d | 162166 | 6.8% |
| t | 136711 | 5.8% |
| c | 124158 | 5.2% |
| n | 114989 | 4.8% |
| l | 106299 | 4.5% |
| i | 99297 | 4.2% |
| Other values (17) | 748582 |
model
Text
MISSING 
| Distinct | 29649 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 5277 |
| Missing (%) | 1.2% |
| Memory size | 3.3 MiB |
Length
| Max length | 203 |
|---|---|
| Median length | 177 |
| Mean length | 11.91973 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5025394 |
|---|---|
| Distinct characters | 117 |
| Distinct categories | 17 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 7 ? |
Unique
| Unique | 15290 ? |
|---|---|
| Unique (%) | 3.6% |
Sample
| 1st row | sierra 1500 crew cab slt |
|---|---|
| 2nd row | silverado 1500 |
| 3rd row | silverado 1500 crew |
| 4th row | tundra double cab sr |
| 5th row | f-150 xlt |
| Value | Count | Frequency (%) |
| 1500 | 24082 | 2.6% |
| sport | 23261 | 2.6% |
| 4d | 18645 | 2.1% |
| silverado | 17396 | 1.9% |
| sedan | 15508 | 1.7% |
| cab | 15224 | 1.7% |
| f-150 | 10417 | 1.1% |
| 4x4 | 9664 | 1.1% |
| grand | 8913 | 1.0% |
| sierra | 8703 | 1.0% |
| Other values (8692) | 757366 |
Most occurring characters
| Value | Count | Frequency (%) |
| 487722 | 9.7% | |
| e | 395746 | 7.9% |
| a | 371389 | 7.4% |
| r | 366159 | 7.3% |
| s | 278320 | 5.5% |
| t | 259161 | 5.2% |
| i | 237544 | 4.7% |
| o | 228379 | 4.5% |
| l | 212188 | 4.2% |
| c | 206496 | 4.1% |
| Other values (107) | 1982290 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3875770 | |
| Decimal Number | 525880 | 10.5% |
| Space Separator | 487722 | 9.7% |
| Uppercase Letter | 64369 | 1.3% |
| Dash Punctuation | 44694 | 0.9% |
| Other Punctuation | 25380 | 0.5% |
| Math Symbol | 434 | < 0.1% |
| Open Punctuation | 371 | < 0.1% |
| Currency Symbol | 365 | < 0.1% |
| Close Punctuation | 333 | < 0.1% |
| Other values (7) | 76 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 395746 | 10.2% |
| a | 371389 | 9.6% |
| r | 366159 | 9.4% |
| s | 278320 | 7.2% |
| t | 259161 | 6.7% |
| i | 237544 | 6.1% |
| o | 228379 | 5.9% |
| l | 212188 | 5.5% |
| c | 206496 | 5.3% |
| n | 190536 | 4.9% |
| Other values (28) | 1129852 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 6644 | 10.3% |
| R | 4584 | 7.1% |
| E | 4401 | 6.8% |
| A | 4207 | 6.5% |
| T | 3910 | 6.1% |
| M | 3828 | 5.9% |
| D | 3653 | 5.7% |
| C | 3437 | 5.3% |
| I | 3293 | 5.1% |
| H | 3222 | 5.0% |
| Other values (19) | 23190 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11618 | |
| * | 5141 | |
| / | 4125 | 16.3% |
| & | 1463 | 5.8% |
| , | 1104 | 4.3% |
| ! | 824 | 3.2% |
| " | 661 | 2.6% |
| ' | 267 | 1.1% |
| % | 59 | 0.2% |
| # | 51 | 0.2% |
| Other values (4) | 67 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 167261 | |
| 5 | 113365 | |
| 4 | 66763 | 12.7% |
| 1 | 52087 | 9.9% |
| 2 | 47125 | 9.0% |
| 3 | 45738 | 8.7% |
| 6 | 12642 | 2.4% |
| 7 | 8293 | 1.6% |
| 8 | 8286 | 1.6% |
| 9 | 4320 | 0.8% |
Other Symbol
| Value | Count | Frequency (%) |
| ♿ | 28 | |
|  | 10 | 19.2% |
| 🚙 | 4 | 7.7% |
| ♦ | 4 | 7.7% |
| 🔥 | 3 | 5.8% |
| ® | 2 | 3.8% |
| 🌟 | 1 | 1.9% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 378 | |
| | | 34 | 7.8% |
| ~ | 17 | 3.9% |
| × | 5 | 1.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 44693 | |
| – | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 368 | |
| [ | 3 | 0.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 332 | |
| ] | 1 | 0.3% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 | |
| ” | 1 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 487722 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 365 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 4 |
Format
| Value | Count | Frequency (%) |
| | 4 |
Other Number
| Value | Count | Frequency (%) |
| ⁰ | 1 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3940123 | |
| Common | 1085269 | 21.6% |
| Cyrillic | 2 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 487722 | ||
| 0 | 167261 | 15.4% |
| 5 | 113365 | 10.4% |
| 4 | 66763 | 6.2% |
| 1 | 52087 | 4.8% |
| 2 | 47125 | 4.3% |
| 3 | 45738 | 4.2% |
| - | 44693 | 4.1% |
| 6 | 12642 | 1.2% |
| . | 11618 | 1.1% |
| Other values (51) | 36255 | 3.3% |
Latin
| Value | Count | Frequency (%) |
| e | 395746 | 10.0% |
| a | 371389 | 9.4% |
| r | 366159 | 9.3% |
| s | 278320 | 7.1% |
| t | 259161 | 6.6% |
| i | 237544 | 6.0% |
| o | 228379 | 5.8% |
| l | 212188 | 5.4% |
| c | 206496 | 5.2% |
| n | 190536 | 4.8% |
| Other values (44) | 1194205 |
Cyrillic
| Value | Count | Frequency (%) |
| М | 1 | |
| Х | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5025298 | |
| Misc Symbols | 32 | < 0.1% |
| None | 25 | < 0.1% |
| Math Alphanum | 14 | < 0.1% |
| Punctuation | 13 | < 0.1% |
| Specials | 10 | < 0.1% |
| Cyrillic | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 487722 | 9.7% | |
| e | 395746 | 7.9% |
| a | 371389 | 7.4% |
| r | 366159 | 7.3% |
| s | 278320 | 5.5% |
| t | 259161 | 5.2% |
| i | 237544 | 4.7% |
| o | 228379 | 4.5% |
| l | 212188 | 4.2% |
| c | 206496 | 4.1% |
| Other values (78) | 1982194 |
Misc Symbols
| Value | Count | Frequency (%) |
| ♿ | 28 | |
| ♦ | 4 | 12.5% |
Specials
| Value | Count | Frequency (%) |
|  | 10 |
None
| Value | Count | Frequency (%) |
| é | 8 | |
| × | 5 | |
| 🚙 | 4 | |
| 🔥 | 3 | 12.0% |
| ® | 2 | 8.0% |
| ⁰ | 1 | 4.0% |
| ó | 1 | 4.0% |
| 🌟 | 1 | 4.0% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 | |
| | 4 | |
| – | 1 | 7.7% |
| ” | 1 | 7.7% |
| “ | 1 | 7.7% |
Math Alphanum
| Value | Count | Frequency (%) |
| 𝓮 | 4 | |
| 𝓵 | 1 | 7.1% |
| 𝓼 | 1 | 7.1% |
| 𝓶 | 1 | 7.1% |
| 𝔃 | 1 | 7.1% |
| 𝓷 | 1 | 7.1% |
| 𝓫 | 1 | 7.1% |
| 𝓭 | 1 | 7.1% |
| 𝓬 | 1 | 7.1% |
| 𝓻 | 1 | 7.1% |
Cyrillic
| Value | Count | Frequency (%) |
| М | 1 | |
| Х | 1 |
condition
Categorical
MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 174104 |
| Missing (%) | 40.8% |
| Memory size | 3.3 MiB |
| good | |
|---|---|
| excellent | |
| like new | |
| fair | 6769 |
| new | 1305 |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 6.3441506 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1603649 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | good |
|---|---|
| 2nd row | good |
| 3rd row | good |
| 4th row | good |
| 5th row | excellent |
Common Values
| Value | Count | Frequency (%) |
| good | 121456 | |
| excellent | 101467 | |
| like new | 21178 | 5.0% |
| fair | 6769 | 1.6% |
| new | 1305 | 0.3% |
| salvage | 601 | 0.1% |
| (Missing) | 174104 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| good | 121456 | |
| excellent | 101467 | |
| new | 22483 | 8.2% |
| like | 21178 | 7.7% |
| fair | 6769 | 2.5% |
| salvage | 601 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 348663 | |
| o | 242912 | |
| l | 224713 | |
| n | 123950 | 7.7% |
| g | 122057 | 7.6% |
| d | 121456 | 7.6% |
| x | 101467 | 6.3% |
| c | 101467 | 6.3% |
| t | 101467 | 6.3% |
| i | 27947 | 1.7% |
| Other values (8) | 87550 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1582471 | |
| Space Separator | 21178 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 348663 | |
| o | 242912 | |
| l | 224713 | |
| n | 123950 | 7.8% |
| g | 122057 | 7.7% |
| d | 121456 | 7.7% |
| x | 101467 | 6.4% |
| c | 101467 | 6.4% |
| t | 101467 | 6.4% |
| i | 27947 | 1.8% |
| Other values (7) | 66372 | 4.2% |
Space Separator
| Value | Count | Frequency (%) |
| 21178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1582471 | |
| Common | 21178 | 1.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 348663 | |
| o | 242912 | |
| l | 224713 | |
| n | 123950 | 7.8% |
| g | 122057 | 7.7% |
| d | 121456 | 7.7% |
| x | 101467 | 6.4% |
| c | 101467 | 6.4% |
| t | 101467 | 6.4% |
| i | 27947 | 1.8% |
| Other values (7) | 66372 | 4.2% |
Common
| Value | Count | Frequency (%) |
| 21178 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1603649 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 348663 | |
| o | 242912 | |
| l | 224713 | |
| n | 123950 | 7.7% |
| g | 122057 | 7.6% |
| d | 121456 | 7.6% |
| x | 101467 | 6.3% |
| c | 101467 | 6.3% |
| t | 101467 | 6.3% |
| i | 27947 | 1.7% |
| Other values (8) | 87550 | 5.5% |
cylinders
Categorical
MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 177678 |
| Missing (%) | 41.6% |
| Memory size | 3.3 MiB |
| 6 cylinders | |
|---|---|
| 4 cylinders | |
| 8 cylinders | |
| 5 cylinders | 1712 |
| 10 cylinders | 1455 |
| Other values (3) | 2162 |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.975426 |
| Min length | 5 |
Characters and Unicode
| Total characters | 2735098 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 8 cylinders |
|---|---|
| 2nd row | 8 cylinders |
| 3rd row | 8 cylinders |
| 4th row | 8 cylinders |
| 5th row | 6 cylinders |
Common Values
| Value | Count | Frequency (%) |
| 6 cylinders | 94169 | |
| 4 cylinders | 77642 | |
| 8 cylinders | 72062 | |
| 5 cylinders | 1712 | 0.4% |
| 10 cylinders | 1455 | 0.3% |
| other | 1298 | 0.3% |
| 3 cylinders | 655 | 0.2% |
| 12 cylinders | 209 | < 0.1% |
| (Missing) | 177678 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cylinders | 247904 | |
| 6 | 94169 | 18.9% |
| 4 | 77642 | 15.6% |
| 8 | 72062 | 14.5% |
| 5 | 1712 | 0.3% |
| 10 | 1455 | 0.3% |
| other | 1298 | 0.3% |
| 3 | 655 | 0.1% |
| 12 | 209 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 249202 | |
| r | 249202 | |
| s | 247904 | |
| 247904 | ||
| c | 247904 | |
| y | 247904 | |
| l | 247904 | |
| i | 247904 | |
| n | 247904 | |
| d | 247904 | |
| Other values (11) | 253462 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2237626 | |
| Decimal Number | 249568 | 9.1% |
| Space Separator | 247904 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 249202 | |
| r | 249202 | |
| s | 247904 | |
| c | 247904 | |
| y | 247904 | |
| l | 247904 | |
| i | 247904 | |
| n | 247904 | |
| d | 247904 | |
| o | 1298 | 0.1% |
| Other values (2) | 2596 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 94169 | |
| 4 | 77642 | |
| 8 | 72062 | |
| 5 | 1712 | 0.7% |
| 1 | 1664 | 0.7% |
| 0 | 1455 | 0.6% |
| 3 | 655 | 0.3% |
| 2 | 209 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 247904 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2237626 | |
| Common | 497472 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 249202 | |
| r | 249202 | |
| s | 247904 | |
| c | 247904 | |
| y | 247904 | |
| l | 247904 | |
| i | 247904 | |
| n | 247904 | |
| d | 247904 | |
| o | 1298 | 0.1% |
| Other values (2) | 2596 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 247904 | ||
| 6 | 94169 | 18.9% |
| 4 | 77642 | 15.6% |
| 8 | 72062 | 14.5% |
| 5 | 1712 | 0.3% |
| 1 | 1664 | 0.3% |
| 0 | 1455 | 0.3% |
| 3 | 655 | 0.1% |
| 2 | 209 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2735098 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 249202 | |
| r | 249202 | |
| s | 247904 | |
| 247904 | ||
| c | 247904 | |
| y | 247904 | |
| l | 247904 | |
| i | 247904 | |
| n | 247904 | |
| d | 247904 | |
| Other values (11) | 253462 |
fuel
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3013 |
| Missing (%) | 0.7% |
| Memory size | 3.3 MiB |
| gas | |
|---|---|
| other | 30728 |
| diesel | 30062 |
| hybrid | 5170 |
| electric | 1698 |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.41438 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1447243 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | gas |
|---|---|
| 2nd row | gas |
| 3rd row | gas |
| 4th row | gas |
| 5th row | gas |
Common Values
| Value | Count | Frequency (%) |
| gas | 356209 | |
| other | 30728 | 7.2% |
| diesel | 30062 | 7.0% |
| hybrid | 5170 | 1.2% |
| electric | 1698 | 0.4% |
| (Missing) | 3013 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| gas | 356209 | |
| other | 30728 | 7.2% |
| diesel | 30062 | 7.1% |
| hybrid | 5170 | 1.2% |
| electric | 1698 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 386271 | |
| g | 356209 | |
| a | 356209 | |
| e | 94248 | 6.5% |
| r | 37596 | 2.6% |
| i | 36930 | 2.6% |
| h | 35898 | 2.5% |
| d | 35232 | 2.4% |
| t | 32426 | 2.2% |
| l | 31760 | 2.2% |
| Other values (4) | 44464 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1447243 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 386271 | |
| g | 356209 | |
| a | 356209 | |
| e | 94248 | 6.5% |
| r | 37596 | 2.6% |
| i | 36930 | 2.6% |
| h | 35898 | 2.5% |
| d | 35232 | 2.4% |
| t | 32426 | 2.2% |
| l | 31760 | 2.2% |
| Other values (4) | 44464 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1447243 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 386271 | |
| g | 356209 | |
| a | 356209 | |
| e | 94248 | 6.5% |
| r | 37596 | 2.6% |
| i | 36930 | 2.6% |
| h | 35898 | 2.5% |
| d | 35232 | 2.4% |
| t | 32426 | 2.2% |
| l | 31760 | 2.2% |
| Other values (4) | 44464 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1447243 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 386271 | |
| g | 356209 | |
| a | 356209 | |
| e | 94248 | 6.5% |
| r | 37596 | 2.6% |
| i | 36930 | 2.6% |
| h | 35898 | 2.5% |
| d | 35232 | 2.4% |
| t | 32426 | 2.2% |
| l | 31760 | 2.2% |
| Other values (4) | 44464 | 3.1% |
odometer
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 104870 |
|---|---|
| Distinct (%) | 24.8% |
| Missing | 4400 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98043.331 |
| Minimum | 0 |
|---|---|
| Maximum | 10000000 |
| Zeros | 1965 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6318 |
| Q1 | 37704 |
| median | 85548 |
| Q3 | 133542.5 |
| 95-th percentile | 204000 |
| Maximum | 10000000 |
| Range | 10000000 |
| Interquartile range (IQR) | 95838.5 |
Descriptive statistics
| Standard deviation | 213881.5 |
|---|---|
| Coefficient of variation (CV) | 2.1814997 |
| Kurtosis | 1690.7574 |
| Mean | 98043.331 |
| Median Absolute Deviation (MAD) | 47910.5 |
| Skewness | 38.040015 |
| Sum | 4.1421347 × 1010 |
| Variance | 4.5745296 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100000 | 2263 | 0.5% |
| 1 | 2246 | 0.5% |
| 0 | 1965 | 0.5% |
| 200000 | 1728 | 0.4% |
| 150000 | 1603 | 0.4% |
| 160000 | 1250 | 0.3% |
| 140000 | 1244 | 0.3% |
| 130000 | 1204 | 0.3% |
| 120000 | 1199 | 0.3% |
| 180000 | 1062 | 0.2% |
| Other values (104860) | 406716 | |
| (Missing) | 4400 | 1.0% |
| Value | Count | Frequency (%) |
| 0 | 1965 | |
| 1 | 2246 | |
| 2 | 153 | < 0.1% |
| 3 | 58 | < 0.1% |
| 4 | 138 | < 0.1% |
| 5 | 193 | < 0.1% |
| 6 | 33 | < 0.1% |
| 7 | 69 | < 0.1% |
| 8 | 37 | < 0.1% |
| 9 | 38 | < 0.1% |
| Value | Count | Frequency (%) |
| 10000000 | 50 | |
| 9999999 | 88 | |
| 9876543 | 1 | < 0.1% |
| 9750924 | 1 | < 0.1% |
| 9099999 | 1 | < 0.1% |
| 9000000 | 3 | < 0.1% |
| 8888888 | 4 | < 0.1% |
| 8765548 | 1 | < 0.1% |
| 8675309 | 1 | < 0.1% |
| 8393929 | 1 | < 0.1% |
title_status
Categorical
IMBALANCE  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8242 |
| Missing (%) | 1.9% |
| Memory size | 3.3 MiB |
| clean | |
|---|---|
| rebuilt | 7219 |
| salvage | 3868 |
| lien | 1422 |
| missing | 814 |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 5.0558239 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2116560 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | clean |
|---|---|
| 2nd row | clean |
| 3rd row | clean |
| 4th row | clean |
| 5th row | clean |
Common Values
| Value | Count | Frequency (%) |
| clean | 405117 | |
| rebuilt | 7219 | 1.7% |
| salvage | 3868 | 0.9% |
| lien | 1422 | 0.3% |
| missing | 814 | 0.2% |
| parts only | 198 | < 0.1% |
| (Missing) | 8242 | 1.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| clean | 405117 | |
| rebuilt | 7219 | 1.7% |
| salvage | 3868 | 0.9% |
| lien | 1422 | 0.3% |
| missing | 814 | 0.2% |
| parts | 198 | < 0.1% |
| only | 198 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 417824 | |
| e | 417626 | |
| a | 413051 | |
| n | 407551 | |
| c | 405117 | |
| i | 10269 | 0.5% |
| t | 7417 | 0.4% |
| r | 7417 | 0.4% |
| u | 7219 | 0.3% |
| b | 7219 | 0.3% |
| Other values (8) | 15850 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2116362 | |
| Space Separator | 198 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 417824 | |
| e | 417626 | |
| a | 413051 | |
| n | 407551 | |
| c | 405117 | |
| i | 10269 | 0.5% |
| t | 7417 | 0.4% |
| r | 7417 | 0.4% |
| u | 7219 | 0.3% |
| b | 7219 | 0.3% |
| Other values (7) | 15652 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 198 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2116362 | |
| Common | 198 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 417824 | |
| e | 417626 | |
| a | 413051 | |
| n | 407551 | |
| c | 405117 | |
| i | 10269 | 0.5% |
| t | 7417 | 0.4% |
| r | 7417 | 0.4% |
| u | 7219 | 0.3% |
| b | 7219 | 0.3% |
| Other values (7) | 15652 | 0.7% |
Common
| Value | Count | Frequency (%) |
| 198 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2116560 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 417824 | |
| e | 417626 | |
| a | 413051 | |
| n | 407551 | |
| c | 405117 | |
| i | 10269 | 0.5% |
| t | 7417 | 0.4% |
| r | 7417 | 0.4% |
| u | 7219 | 0.3% |
| b | 7219 | 0.3% |
| Other values (8) | 15850 | 0.7% |
transmission
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2556 |
| Missing (%) | 0.6% |
| Memory size | 3.3 MiB |
| automatic | |
|---|---|
| other | |
| manual | 25118 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.2315259 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3492834 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | other |
|---|---|
| 2nd row | other |
| 3rd row | other |
| 4th row | other |
| 5th row | automatic |
Common Values
| Value | Count | Frequency (%) |
| automatic | 336524 | |
| other | 62682 | 14.7% |
| manual | 25118 | 5.9% |
| (Missing) | 2556 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| automatic | 336524 | |
| other | 62682 | 14.8% |
| manual | 25118 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 735730 | |
| a | 723284 | |
| o | 399206 | |
| u | 361642 | |
| m | 361642 | |
| i | 336524 | |
| c | 336524 | |
| h | 62682 | 1.8% |
| e | 62682 | 1.8% |
| r | 62682 | 1.8% |
| Other values (2) | 50236 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3492834 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 735730 | |
| a | 723284 | |
| o | 399206 | |
| u | 361642 | |
| m | 361642 | |
| i | 336524 | |
| c | 336524 | |
| h | 62682 | 1.8% |
| e | 62682 | 1.8% |
| r | 62682 | 1.8% |
| Other values (2) | 50236 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3492834 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 735730 | |
| a | 723284 | |
| o | 399206 | |
| u | 361642 | |
| m | 361642 | |
| i | 336524 | |
| c | 336524 | |
| h | 62682 | 1.8% |
| e | 62682 | 1.8% |
| r | 62682 | 1.8% |
| Other values (2) | 50236 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3492834 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 735730 | |
| a | 723284 | |
| o | 399206 | |
| u | 361642 | |
| m | 361642 | |
| i | 336524 | |
| c | 336524 | |
| h | 62682 | 1.8% |
| e | 62682 | 1.8% |
| r | 62682 | 1.8% |
| Other values (2) | 50236 | 1.4% |
VIN
Text
MISSING 
| Distinct | 118246 |
|---|---|
| Distinct (%) | 44.5% |
| Missing | 161042 |
| Missing (%) | 37.7% |
| Memory size | 3.3 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 17 |
| Mean length | 16.958757 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4508282 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 77966 ? |
|---|---|
| Unique (%) | 29.3% |
Sample
| 1st row | 3GTP1VEC4EG551563 |
|---|---|
| 2nd row | 1GCSCSE06AZ123805 |
| 3rd row | 3GCPWCED5LG130317 |
| 4th row | 5TFRM5F17HX120972 |
| 5th row | 1GT220CG8CZ231238 |
| Value | Count | Frequency (%) |
| 1fmju1jt1hea52352 | 261 | 0.1% |
| 3c6jr6dt3kg560649 | 235 | 0.1% |
| 1fter1eh1lla36301 | 231 | 0.1% |
| 5tftx4cn3ex042751 | 227 | 0.1% |
| 1gchtce37g1186784 | 214 | 0.1% |
| 1gtn1teh5ez273019 | 207 | 0.1% |
| 3vwf17at1fm655022 | 199 | 0.1% |
| jn1az4eh8km420880 | 198 | 0.1% |
| 1ftmf1cp3gkd62143 | 195 | 0.1% |
| 1gtr1we07dz143724 | 194 | 0.1% |
| Other values (118236) | 263677 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 425101 | 9.4% |
| 2 | 279570 | 6.2% |
| 3 | 273707 | 6.1% |
| 5 | 273040 | 6.1% |
| 4 | 241769 | 5.4% |
| 0 | 241757 | 5.4% |
| 6 | 223745 | 5.0% |
| 7 | 209685 | 4.7% |
| 8 | 198070 | 4.4% |
| 9 | 172062 | 3.8% |
| Other values (28) | 1969776 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2538506 | |
| Uppercase Letter | 1969187 | |
| Math Symbol | 295 | < 0.1% |
| Other Punctuation | 294 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 165210 | 8.4% |
| C | 160073 | 8.1% |
| G | 147356 | 7.5% |
| A | 143057 | 7.3% |
| E | 118518 | 6.0% |
| J | 113114 | 5.7% |
| B | 108624 | 5.5% |
| D | 104251 | 5.3% |
| K | 97569 | 5.0% |
| T | 95123 | 4.8% |
| Other values (16) | 716292 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 425101 | |
| 2 | 279570 | |
| 3 | 273707 | |
| 5 | 273040 | |
| 4 | 241769 | |
| 0 | 241757 | |
| 6 | 223745 | |
| 7 | 209685 | |
| 8 | 198070 | |
| 9 | 172062 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 295 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 294 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2539095 | |
| Latin | 1969187 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 165210 | 8.4% |
| C | 160073 | 8.1% |
| G | 147356 | 7.5% |
| A | 143057 | 7.3% |
| E | 118518 | 6.0% |
| J | 113114 | 5.7% |
| B | 108624 | 5.5% |
| D | 104251 | 5.3% |
| K | 97569 | 5.0% |
| T | 95123 | 4.8% |
| Other values (16) | 716292 |
Common
| Value | Count | Frequency (%) |
| 1 | 425101 | |
| 2 | 279570 | |
| 3 | 273707 | |
| 5 | 273040 | |
| 4 | 241769 | |
| 0 | 241757 | |
| 6 | 223745 | |
| 7 | 209685 | |
| 8 | 198070 | |
| 9 | 172062 | |
| Other values (2) | 589 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4508282 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 425101 | 9.4% |
| 2 | 279570 | 6.2% |
| 3 | 273707 | 6.1% |
| 5 | 273040 | 6.1% |
| 4 | 241769 | 5.4% |
| 0 | 241757 | 5.4% |
| 6 | 223745 | 5.0% |
| 7 | 209685 | 4.7% |
| 8 | 198070 | 4.4% |
| 9 | 172062 | 3.8% |
| Other values (28) | 1969776 |
drive
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 130567 |
| Missing (%) | 30.6% |
| Memory size | 3.3 MiB |
| 4wd | |
|---|---|
| fwd | |
| rwd |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 888939 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | rwd |
|---|---|
| 2nd row | 4wd |
| 3rd row | 4wd |
| 4th row | 4wd |
| 5th row | 4wd |
Common Values
| Value | Count | Frequency (%) |
| 4wd | 131904 | |
| fwd | 105517 | |
| rwd | 58892 | |
| (Missing) | 130567 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4wd | 131904 | |
| fwd | 105517 | |
| rwd | 58892 |
Most occurring characters
| Value | Count | Frequency (%) |
| w | 296313 | |
| d | 296313 | |
| 4 | 131904 | |
| f | 105517 | 11.9% |
| r | 58892 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 757035 | |
| Decimal Number | 131904 | 14.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 296313 | |
| d | 296313 | |
| f | 105517 | 13.9% |
| r | 58892 | 7.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 131904 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 757035 | |
| Common | 131904 | 14.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| w | 296313 | |
| d | 296313 | |
| f | 105517 | 13.9% |
| r | 58892 | 7.8% |
Common
| Value | Count | Frequency (%) |
| 4 | 131904 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 888939 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| w | 296313 | |
| d | 296313 | |
| 4 | 131904 | |
| f | 105517 | 11.9% |
| r | 58892 | 6.6% |
size
Categorical
MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 306361 |
| Missing (%) | 71.8% |
| Memory size | 3.3 MiB |
| full-size | |
|---|---|
| mid-size | |
| compact | |
| sub-compact | 3194 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.4452659 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1017815 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | full-size |
|---|---|
| 2nd row | full-size |
| 3rd row | full-size |
| 4th row | full-size |
| 5th row | full-size |
Common Values
| Value | Count | Frequency (%) |
| full-size | 63465 | 14.9% |
| mid-size | 34476 | 8.1% |
| compact | 19384 | 4.5% |
| sub-compact | 3194 | 0.7% |
| (Missing) | 306361 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| full-size | 63465 | |
| mid-size | 34476 | |
| compact | 19384 | 16.1% |
| sub-compact | 3194 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 132417 | |
| l | 126930 | |
| - | 101135 | |
| s | 101135 | |
| z | 97941 | |
| e | 97941 | |
| u | 66659 | |
| f | 63465 | |
| m | 57054 | |
| c | 45156 | 4.4% |
| Other values (6) | 127982 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 916680 | |
| Dash Punctuation | 101135 | 9.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 132417 | |
| l | 126930 | |
| s | 101135 | |
| z | 97941 | |
| e | 97941 | |
| u | 66659 | |
| f | 63465 | |
| m | 57054 | |
| c | 45156 | 4.9% |
| d | 34476 | 3.8% |
| Other values (5) | 93506 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 101135 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 916680 | |
| Common | 101135 | 9.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 132417 | |
| l | 126930 | |
| s | 101135 | |
| z | 97941 | |
| e | 97941 | |
| u | 66659 | |
| f | 63465 | |
| m | 57054 | |
| c | 45156 | 4.9% |
| d | 34476 | 3.8% |
| Other values (5) | 93506 |
Common
| Value | Count | Frequency (%) |
| - | 101135 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1017815 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 132417 | |
| l | 126930 | |
| - | 101135 | |
| s | 101135 | |
| z | 97941 | |
| e | 97941 | |
| u | 66659 | |
| f | 63465 | |
| m | 57054 | |
| c | 45156 | 4.4% |
| Other values (6) | 127982 |
type
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 92858 |
| Missing (%) | 21.8% |
| Memory size | 3.3 MiB |
| sedan | |
|---|---|
| SUV | |
| pickup | |
| truck | |
| other | |
| Other values (8) |
Length
| Max length | 11 |
|---|---|
| Median length | 5 |
| Mean length | 4.9978534 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1669393 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | pickup |
|---|---|
| 2nd row | pickup |
| 3rd row | pickup |
| 4th row | pickup |
| 5th row | truck |
Common Values
| Value | Count | Frequency (%) |
| sedan | 87056 | |
| SUV | 77284 | |
| pickup | 43510 | |
| truck | 35279 | 8.3% |
| other | 22110 | 5.2% |
| coupe | 19204 | 4.5% |
| hatchback | 16598 | 3.9% |
| wagon | 10751 | 2.5% |
| van | 8548 | 2.0% |
| convertible | 7731 | 1.8% |
| Other values (3) | 5951 | 1.4% |
| (Missing) | 92858 |
Length
| Value | Count | Frequency (%) |
| sedan | 87056 | |
| suv | 77284 | |
| pickup | 43510 | |
| truck | 35279 | |
| other | 22110 | 6.6% |
| coupe | 19204 | 5.7% |
| hatchback | 16598 | 5.0% |
| wagon | 10751 | 3.2% |
| van | 8548 | 2.6% |
| convertible | 7731 | 2.3% |
| Other values (3) | 5951 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 144985 | 8.7% |
| e | 143832 | 8.6% |
| c | 138920 | 8.3% |
| n | 123736 | 7.4% |
| p | 106224 | 6.4% |
| u | 98510 | 5.9% |
| k | 95387 | 5.7% |
| d | 87665 | 5.3% |
| s | 87573 | 5.2% |
| t | 81718 | 4.9% |
| Other values (15) | 560843 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1432716 | |
| Uppercase Letter | 231852 | 13.9% |
| Dash Punctuation | 4825 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 144985 | |
| e | 143832 | |
| c | 138920 | |
| n | 123736 | 8.6% |
| p | 106224 | 7.4% |
| u | 98510 | 6.9% |
| k | 95387 | 6.7% |
| d | 87665 | 6.1% |
| s | 87573 | 6.1% |
| t | 81718 | 5.7% |
| Other values (11) | 324166 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 77284 | |
| U | 77284 | |
| S | 77284 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4825 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1664568 | |
| Common | 4825 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 144985 | 8.7% |
| e | 143832 | 8.6% |
| c | 138920 | 8.3% |
| n | 123736 | 7.4% |
| p | 106224 | 6.4% |
| u | 98510 | 5.9% |
| k | 95387 | 5.7% |
| d | 87665 | 5.3% |
| s | 87573 | 5.3% |
| t | 81718 | 4.9% |
| Other values (14) | 556018 |
Common
| Value | Count | Frequency (%) |
| - | 4825 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1669393 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 144985 | 8.7% |
| e | 143832 | 8.6% |
| c | 138920 | 8.3% |
| n | 123736 | 7.4% |
| p | 106224 | 6.4% |
| u | 98510 | 5.9% |
| k | 95387 | 5.7% |
| d | 87665 | 5.3% |
| s | 87573 | 5.2% |
| t | 81718 | 4.9% |
| Other values (15) | 560843 |
paint_color
Categorical
MISSING 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 130203 |
| Missing (%) | 30.5% |
| Memory size | 3.3 MiB |
| white | |
|---|---|
| black | |
| silver | |
| blue | |
| red | |
| Other values (7) |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.7906747 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1421283 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | white |
|---|---|
| 2nd row | blue |
| 3rd row | red |
| 4th row | red |
| 5th row | black |
Common Values
| Value | Count | Frequency (%) |
| white | 79285 | |
| black | 62861 | |
| silver | 42970 | 10.1% |
| blue | 31223 | 7.3% |
| red | 30473 | 7.1% |
| grey | 24416 | 5.7% |
| green | 7343 | 1.7% |
| custom | 6700 | 1.6% |
| brown | 6593 | 1.5% |
| yellow | 2142 | 0.5% |
| Other values (2) | 2671 | 0.6% |
| (Missing) | 130203 |
Length
| Value | Count | Frequency (%) |
| white | 79285 | |
| black | 62861 | |
| silver | 42970 | |
| blue | 31223 | 10.5% |
| red | 30473 | 10.3% |
| grey | 24416 | 8.2% |
| green | 7343 | 2.5% |
| custom | 6700 | 2.3% |
| brown | 6593 | 2.2% |
| yellow | 2142 | 0.7% |
| Other values (2) | 2671 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 227866 | |
| l | 142025 | |
| i | 122255 | 8.6% |
| r | 114466 | 8.1% |
| b | 100677 | 7.1% |
| w | 88020 | 6.2% |
| t | 85985 | 6.0% |
| h | 79285 | 5.6% |
| c | 69561 | 4.9% |
| a | 64845 | 4.6% |
| Other values (11) | 326298 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1421283 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 227866 | |
| l | 142025 | |
| i | 122255 | 8.6% |
| r | 114466 | 8.1% |
| b | 100677 | 7.1% |
| w | 88020 | 6.2% |
| t | 85985 | 6.0% |
| h | 79285 | 5.6% |
| c | 69561 | 4.9% |
| a | 64845 | 4.6% |
| Other values (11) | 326298 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1421283 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 227866 | |
| l | 142025 | |
| i | 122255 | 8.6% |
| r | 114466 | 8.1% |
| b | 100677 | 7.1% |
| w | 88020 | 6.2% |
| t | 85985 | 6.0% |
| h | 79285 | 5.6% |
| c | 69561 | 4.9% |
| a | 64845 | 4.6% |
| Other values (11) | 326298 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1421283 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 227866 | |
| l | 142025 | |
| i | 122255 | 8.6% |
| r | 114466 | 8.1% |
| b | 100677 | 7.1% |
| w | 88020 | 6.2% |
| t | 85985 | 6.0% |
| h | 79285 | 5.6% |
| c | 69561 | 4.9% |
| a | 64845 | 4.6% |
| Other values (11) | 326298 |
state
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 853760 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | az |
|---|---|
| 2nd row | ar |
| 3rd row | fl |
| 4th row | ma |
| 5th row | nc |
| Value | Count | Frequency (%) |
| ca | 50614 | 11.9% |
| fl | 28511 | 6.7% |
| tx | 22945 | 5.4% |
| ny | 19386 | 4.5% |
| oh | 17696 | 4.1% |
| or | 17104 | 4.0% |
| mi | 16900 | 4.0% |
| nc | 15277 | 3.6% |
| wa | 13861 | 3.2% |
| pa | 13753 | 3.2% |
| Other values (41) | 210833 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 137111 | |
| c | 91464 | |
| n | 80937 | 9.5% |
| i | 67266 | 7.9% |
| o | 56973 | 6.7% |
| m | 56562 | 6.6% |
| t | 49156 | 5.8% |
| l | 47049 | 5.5% |
| f | 28511 | 3.3% |
| w | 26921 | 3.2% |
| Other values (14) | 211810 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 853760 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 137111 | |
| c | 91464 | |
| n | 80937 | 9.5% |
| i | 67266 | 7.9% |
| o | 56973 | 6.7% |
| m | 56562 | 6.6% |
| t | 49156 | 5.8% |
| l | 47049 | 5.5% |
| f | 28511 | 3.3% |
| w | 26921 | 3.2% |
| Other values (14) | 211810 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 853760 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 137111 | |
| c | 91464 | |
| n | 80937 | 9.5% |
| i | 67266 | 7.9% |
| o | 56973 | 6.7% |
| m | 56562 | 6.6% |
| t | 49156 | 5.8% |
| l | 47049 | 5.5% |
| f | 28511 | 3.3% |
| w | 26921 | 3.2% |
| Other values (14) | 211810 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 853760 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 137111 | |
| c | 91464 | |
| n | 80937 | 9.5% |
| i | 67266 | 7.9% |
| o | 56973 | 6.7% |
| m | 56562 | 6.6% |
| t | 49156 | 5.8% |
| l | 47049 | 5.5% |
| f | 28511 | 3.3% |
| w | 26921 | 3.2% |
| Other values (14) | 211810 |
| condition | cylinders | drive | fuel | id | manufacturer | odometer | paint_color | price | size | title_status | transmission | type | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| condition | 1.000 | 0.079 | 0.099 | 0.154 | -0.027 | 0.083 | -0.229 | 0.071 | 0.184 | 0.038 | 0.135 | 0.383 | 0.141 | 0.175 |
| cylinders | 0.079 | 1.000 | 0.386 | 0.198 | -0.031 | 0.338 | 0.017 | 0.074 | 0.265 | 0.315 | 0.040 | 0.156 | 0.244 | -0.108 |
| drive | 0.099 | 0.386 | 1.000 | 0.162 | 0.016 | 0.459 | -0.094 | 0.120 | -0.109 | 0.225 | 0.037 | 0.113 | 0.548 | -0.096 |
| fuel | 0.154 | 0.198 | 0.162 | 1.000 | -0.017 | 0.356 | -0.173 | 0.090 | -0.000 | 0.145 | 0.025 | 0.254 | 0.243 | 0.140 |
| id | -0.027 | -0.031 | 0.016 | -0.017 | 1.000 | 0.086 | 0.045 | 0.026 | -0.079 | 0.007 | 0.013 | 0.046 | 0.050 | -0.085 |
| manufacturer | 0.083 | 0.338 | 0.459 | 0.356 | 0.086 | 1.000 | 0.032 | 0.100 | -0.051 | 0.257 | 0.037 | 0.198 | 0.266 | -0.000 |
| odometer | -0.229 | 0.017 | -0.094 | -0.173 | 0.045 | 0.032 | 1.000 | 0.009 | -0.457 | 0.004 | 0.031 | 0.024 | 0.008 | -0.651 |
| paint_color | 0.071 | 0.074 | 0.120 | 0.090 | 0.026 | 0.100 | 0.009 | 1.000 | 0.028 | 0.080 | 0.023 | 0.134 | 0.094 | 0.010 |
| price | 0.184 | 0.265 | -0.109 | -0.000 | -0.079 | -0.051 | -0.457 | 0.028 | 1.000 | 0.000 | 0.000 | 0.007 | 0.000 | 0.491 |
| size | 0.038 | 0.315 | 0.225 | 0.145 | 0.007 | 0.257 | 0.004 | 0.080 | 0.000 | 1.000 | 0.021 | 0.133 | 0.333 | -0.010 |
| title_status | 0.135 | 0.040 | 0.037 | 0.025 | 0.013 | 0.037 | 0.031 | 0.023 | 0.000 | 0.021 | 1.000 | 0.061 | 0.031 | -0.027 |
| transmission | 0.383 | 0.156 | 0.113 | 0.254 | 0.046 | 0.198 | 0.024 | 0.134 | 0.007 | 0.133 | 0.061 | 1.000 | 0.284 | 0.223 |
| type | 0.141 | 0.244 | 0.548 | 0.243 | 0.050 | 0.266 | 0.008 | 0.094 | 0.000 | 0.333 | 0.031 | 0.284 | 1.000 | -0.002 |
| year | 0.175 | -0.108 | -0.096 | 0.140 | -0.085 | -0.000 | -0.651 | 0.010 | 0.491 | -0.010 | -0.027 | 0.223 | -0.002 | 1.000 |
| id | region | price | year | manufacturer | model | condition | cylinders | fuel | odometer | title_status | transmission | VIN | drive | size | type | paint_color | state | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 7222695916 | prescott | 6000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | az |
| 1 | 7218891961 | fayetteville | 11900 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ar |
| 2 | 7221797935 | florida keys | 21000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | fl |
| 3 | 7222270760 | worcester / central MA | 1500 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ma |
| 4 | 7210384030 | greensboro | 4900 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | nc |
| 5 | 7222379453 | hudson valley | 1600 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ny |
| 6 | 7221952215 | hudson valley | 1000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ny |
| 7 | 7220195662 | hudson valley | 15995 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ny |
| 8 | 7209064557 | medford-ashland | 5000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | or |
| 9 | 7219485069 | erie | 3000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | pa |
| id | region | price | year | manufacturer | model | condition | cylinders | fuel | odometer | title_status | transmission | VIN | drive | size | type | paint_color | state | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 426870 | 7301592119 | wyoming | 22990 | 2020.0 | hyundai | sonata se sedan 4d | good | NaN | gas | 3066.0 | clean | other | 5NPEG4JAXLH051710 | fwd | NaN | sedan | blue | wy |
| 426871 | 7301591639 | wyoming | 17990 | 2018.0 | kia | sportage lx sport utility 4d | good | NaN | gas | 34239.0 | clean | other | KNDPMCAC7J7417329 | NaN | NaN | SUV | NaN | wy |
| 426872 | 7301591201 | wyoming | 32590 | 2020.0 | mercedes-benz | c-class c 300 | good | NaN | gas | 19059.0 | clean | other | 55SWF8DB6LU325050 | rwd | NaN | sedan | white | wy |
| 426873 | 7301591202 | wyoming | 30990 | 2018.0 | mercedes-benz | glc 300 sport | good | NaN | gas | 15080.0 | clean | automatic | WDC0G4JB6JV019749 | rwd | NaN | other | white | wy |
| 426874 | 7301591199 | wyoming | 33590 | 2018.0 | lexus | gs 350 sedan 4d | good | 6 cylinders | gas | 30814.0 | clean | automatic | JTHBZ1BLXJA012999 | rwd | NaN | sedan | white | wy |
| 426875 | 7301591192 | wyoming | 23590 | 2019.0 | nissan | maxima s sedan 4d | good | 6 cylinders | gas | 32226.0 | clean | other | 1N4AA6AV6KC367801 | fwd | NaN | sedan | NaN | wy |
| 426876 | 7301591187 | wyoming | 30590 | 2020.0 | volvo | s60 t5 momentum sedan 4d | good | NaN | gas | 12029.0 | clean | other | 7JR102FKXLG042696 | fwd | NaN | sedan | red | wy |
| 426877 | 7301591147 | wyoming | 34990 | 2020.0 | cadillac | xt4 sport suv 4d | good | NaN | diesel | 4174.0 | clean | other | 1GYFZFR46LF088296 | NaN | NaN | hatchback | white | wy |
| 426878 | 7301591140 | wyoming | 28990 | 2018.0 | lexus | es 350 sedan 4d | good | 6 cylinders | gas | 30112.0 | clean | other | 58ABK1GG4JU103853 | fwd | NaN | sedan | silver | wy |
| 426879 | 7301591129 | wyoming | 30590 | 2019.0 | bmw | 4 series 430i gran coupe | good | NaN | gas | 22716.0 | clean | other | WBA4J1C58KBM14708 | rwd | NaN | coupe | NaN | wy |